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^ A series of tether-linked bis netropsins have been synthesized in order to assess the phasing 

^ problem, which arises because of the lack of dimensional correspondence between oligopeptides 

^ and oligonucleotides in DNA binding characteristics. The consequences of incorporating var- 

= iable-length flexible and rigid tethers (poly(methylene), Z and E ethylene, m- and p-phenylene] 

^ between the two netropsin-like moieties on the DNA binding properties were assessed by DNase 

03 I footprinting. The conformational freedom associated with two netropsins linked by a flexible 

I""" methylene tether allows ligand binding in both a mono- and bidentate fashion, with bidentate 

HI binding requiring a minimum linker length of (CH 2 ) 3 . For compounds possessing rigid tethers, 
for example, cis and trans ethylene moieties, the cis geometry excludes bidentate ligation while 

p the trans structure favors it. Bis netropsins possessing aryl linki ng groups have reduced DNA 

~ binding affinities. This is most plausibly due to the aryl groups, which are not coplanar with 

J* the netropsin moieties, thus blocking the ligand from penetrating deeply into the minor groove 
of DNA. 



Introduction 

A number of strategies have been used to target specific 
sites on DNA (1). For single-stranded DNA, oligo- 
nucleotides complementary to certain sequences on DNA 
have been synthesized and characterized. Certain of these 
oligonucleotides have been equipped with DNA-damaging 
activity by incorporation of chemical cleaving or alkylating 
agents into their molecular structure (2 t 3). In an effort 
to minimize attack by nucleases present in the cell and, 
thus, enhance the potential of these new agents as useful 
drugs, compounds having modified stereochemistry about 
phosphorus (4) as well as an altered configuration about 
the anomeric carbon atom of the sugar moiety (5, 6) have 
been made. In addition to single-stranded DNA, efforts 
have been made to identify ligands that attack or bind to 
double-stranded DNA. Among these are single-stranded 
oligonucleotides, which, under certain conditions, will form 
a triple helix with duplex DNA (7). Although triple-helix 
formation involves the wider major groove of DNA, the 
bulk of the small ligands studied to date attack DNA via 
its minor groove (1). 

Netropsin and distamycin are N-methylpyrrole oligo- 
peptides (Figure 1) which bind in the minor groove of DNA 
and have high specificity for AT sites. Single-crystal X-ray 
analyses of the DNA complexes of the drugs bound to 
small segments of DNA have revealed that hydrogen 
bonding and van der Waals contacts are the key features 
in the drug-DNA recognition process (9-11). Since the 
drugs are cationic and the minor groove of DNA in AT-rich 
regions bears a high negative potential (12), the electro- 
static potential surface of DNA also plays a role in spe- 
cificity. Recognition of the importance of the electrostatic 
effect led to the development of monocationic lexitropsins, 
or information-reading oligopeptides, in which individual 
DNA base recognition elements are better able to express 
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themselves since the electrostatic influences are no lc 
dominant (13-15). For example, in an attempt to mt 
the AT specificity of a netropsin-like molecule, we 
thesized and studied the DNA binding properties 
series of IV-methylimidazole oligopeptide agents (13 
By use of DNase I footprinting methods (16), certa 
these lexitropsins were found to bind in the minor gr 
of DNA at unique GC-rich sites. This observation, as 
as NMR evidence (17, 18) led to a binding media: 
involving hydrogen bonding between the N~me 
imidazole moiety and the 2-amino group of guanine lex 
in the minor groove of DNA. 

In exploring the structural elements that control 
ognition, we also studied the chiral oligopeptide antifc 
antibiotics anthelvencin A and dihydrokikumycin B 
20). While the larger tripeptide anthelvencin exhil 
little enantiomeric selectivity for DNA, the natural 
isomer of the dipeptidic dihydrokikumycin bound 1 
AT interaction sites with an affinity ~3 times greater 
its unnatural enantiomer (20). 

Additional structural elements have been identify 
being important in the molecular recognition of c 
peptides for DNA include the steric hindrance affo 
by the carboxyl-terminal methylene group in the c 
peptide antibiotics and, e.g., the guanine 2-NH 2 grot 
the minor groove of DNA (21), which is an impoi 
carrier of biological information (9). The fact that 
steric clash ensures reading of a 3'- AT terminal gro 
indicated when excision of this methylene permits a 2 
recognition and binding as demonstrated by NMR 
printing (21). 

Yet another structural design element emerges fro 
examination of the DNA sequence selectivity of this 
bearing lexitropsins. Those structures in which the thi 
nitrogen is aligned into the minor groove result in (K 
recognition analogous to that of imidazoles. In conl 
the isomeric structures, in which the larger thiazole s 
is aligned in to the minor groove, result in GC site 
dance, presumably for steric hindrance reasons (22 
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Figure L The structures of netropsin (1) and distamycin (2) are 
shown. 

The ultimate goal in developing lexitropsins, or other 
groove-selective ligands. as potential gene-control agents 
is to produce general agents capable of delivery into the 
cell and of recognizing any predetermined sequence. It 
can be shown statistically that this requires a run of 15 or 
16 base pairs that defines a unique sequence in the human 
genome (24). 

One of the central problems in targeting such relatively 
long segments of DNA using netropsin type molecules 
concerns the relationship between the repeat distance of 
a nucleotide unit of DNA and the hydrogen bond and van 
der Waals contacts generated by an N-methylpyrrole 
peptide. This problem, sometimes referred to as the 
phasing problem, was recently addressed by Goodsell and 
Dickerson (25). In model studies involving idealized B- 
DNA coordinates, these workers demonstrated that as the 
netropsin-like ligand increases in length, the hydrogen 
bond contacts and van der Waals contacts between the 
ligand and DNA become seriously out of phase with the 
spacing between the nucleotides of DNA. Thus, simply 
increasing the length of iV-methylpyrrole oligopeptide to 
gain additional degrees of specificity ultimately results in 
a severe mismatch between the peptide and DNA which 
is manifested in reduced binding affinities. This phe- 
nomenon probably explains the reduced binding affinities 
exhibited by certain poly(N-methylpyrrole) peptides 
studied by Dervan and co-workers (32). 

One way to address the phasing problem is to connect 
netropsin-like frameworks together by using a tether. This 
approach, first used by Khorlin and co-workers (27), in- 
dicated that both netropsin moieties of a tether-linked bis 
netropsin could bind to DNA. In this paper, we explore 
the consequences of incorporating variable-length flexible 
and rigid tethers between two netropsin-like molecules 
(Figure 2) on the DNA binding properties of the resulting 
structure. The DNA binding specificities of the com- 
pounds were studied by using DNase I footprinting 
methodology. 

Materials and Methods 

Melting points were determined on a Fisher-Johns apparatus 
and are uncorrected. The IR spectra were recorded on Nicoiet 
7199 FT spectrophotometer, and only the principal peaks are 
reported. The NMR spectra were recorded on Bruker WH-200 
and WH-400 spectrometers. FAB (fast atom bombardment) mass 
spectra were determined on Associated Electrical Industries (AED 
MS- 9 and MS-50 focusing high- resolution mass spectrometers. 
Kieselgel 60 (230-400 mesh) of E. Merck was used for flash 
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Figure 2. The structures of the bis netropsin ligands used ii 
study are shown. 

chromatography, and precoated sheets of silica gel 6OF2&4 
Merck were used for TLC. TLC systems were as follows: ( 
covalent peptidic compounds, chloroform / methanol 9:1; (i 
ionic compounds with one ionic pair, methanol with some A 
(iii) for ionic compounds with two ionic pairs, methanol with 
formic acid. 

Synthesis of Linked Bis Oligopeptides. The synthe 
compounds 3-11 has been reported (28). 

iV-[l-Methy l-4-[ l-methyl-4-<4-nitriiobutanamido) 
"role-2-carboxamido]-2-pyrrolyl]-i>T / -[i-methyi-2- 
n^thyl*2-[[(3-nitrilopropyl)8jniiioiearbonyl]-4-pyrrc 
amino]carbonyl]-4 pyrrolyl]maJeamide« 3-([l-Methyk 
methyl-2-ammo-4-pyrrolyl)carbamoyll-4-pyrrolyl]carban 
propionitrile (28) (158 mg, 0.5 mmol) and maleic anhydric 
mg t 0.5 mmol) were heated in acetonitrile (5 mL) at 50 0 
3 min. Another portion of 7 (158 mg) was added, and the so) 
was evaporated to dryness. The residual solid was dissolv 
DMF (2 mL), DCC (103 mg, 0.5 mmol) wis added, am 
mixture was set aside overnight at room temperature. Two< 
of water were added and the solution was filtered Then an c 
of water precipitated the crude product. It was collect* 
chromatographed on slilica gel with chloroform and 15% 1 
anol, providing yellow fractions. These were combinec 
evaporated, and the residue was recrystallized from acetc 
give the pure product as a crystalline solid: 100 mg (5&5% 5 
mp 250-2 °C; *H NMR (DMSO-dJ 5 2.74 (t, 4 H). 3.54 (q, 
3.83 and 3.88 (2 s, 12 H), 6.36 (s, 2 H), 6.84 and 6.88 (2 d, 
7.23 (d, 2 H), 7.33 (d, 2 H), 8.36 (t, 2 H), 9.97 (a, 2 H), 1L 
2 H); IR (film) 1377, 1438, 1462, 1536, 1630, 1650, 2245, 3270 
FAB-MS (m/z) 709 MH + . Anal Calcd for CaHaeNuOe: C 
H, 5.1; N, 23.7. Found: C, 58.0; H, 5.0; N, 23.9. 
iV-[4-[4-(4-Ainino-4-iminobutajiamido)-l*methylpyi 

2- carboxamido]-l-methyl-2-pyrrolyll-Ar-[2-[[[2-[[(3-ai 

3- iminopropyI)amino]carbonyl]-l-methyl-4-pyrrt 
amino Jcarbonyi]- 1-methy l-4-pymlyl]maleamide Dih 
chloride (13). The above compound (HOlng, 0.24 mm 
treated under Pinner reaction conditions as in example 3 
28. The completed reaction mixture was evaporated to dr 
and the residue dissolved in ethanoL Controlled additi 
2-propanol provided selective precipitation of impurities, 
mother liquor was evaporated, the residue was dissolv 
methanol, and precipitation with acetonitrile gave pure com) 
(13) as a crystalline solid: 166 mg (85% yield); mp 217 0 
NMR (DMSO-d 6 ) 5 2.75 (t, 4 H), 3.54 (q, 4 H), 3.82 and 3 
s, 12 H). 6.41 (s, 2 H), 6.97 (d, 2 H), 7.02 (d, 2 H), 7.23 (d, 
7.36 (d, 2 H), 8.30 (t, 2 H), 8.77 (bs, 4 H), 9.08 (bs, 4 H), 1* 
2 H), 11.34 (s, 2 H); IR (Nujol) 1377, 1463, 1528, 1581, 1637, 
3252 cm' 1 ; FAB-MS (m/z) 743 (M - HC1 - CD*. AnaL 
for C 3 4H 44 N l 40 6 Cl 2 : C, 50.1; H, 5.4; N, 24.0; CI, 8.7. Four 
49.7; H. 5.6; N, 24.3: CI, 8.9. 

N-(l-Methyl-4-[ 1-methy l-4-(4-nitrilobutanamido 
role-2«carboxamido]-2-pyrrolyl]-AT'-[l-methyI-2 
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methyl-2-[[(3.nitriIopropyl)ainino]c«rbonyl]-4-pyrn>lyn- 
Miimo]^nyl]^pyrn,lyI]fumanunide. Thh«mpoS I wi 
prepared by a similar procedure to that described above for the 

SKSSt&XSf?! in 84 - 7% y je,d « a crystalline solid: mp 
289-90 °C; «H NMR (DMSO-ay 6 2.74 (t, 4 H) 3.42 fa 4 m 3*2 
and 3.86 (2 s, 12 H), 6.95 (2 a\4 H), 7.09 (s?2 ffi 7 Z2 f dL 2 m 

55 S ^ift l , 53 ?' 4 1559 ' 1586. 1631. 1658. 1692. 22W 
n 24 ^ I? !i p AB-MS (m/z) 709 MH*. AnaL Calcd for 
gA^N, 2 0,: C. 57.6; H. 5.1; N. 23.7. Found; C. 57.9; H. 5.3; 

?^L^ I4 " ( ^^. n ^ 4 "i minobutenMido >- Im et«yipy™ie- 

2wboxamidoH.methyl-2-pyrrolyl].Ar'-t2-t[[2-[[(3-ajnino- 

nydrochlonde (12). The title compound was prepared from the 
^ e P?P/° mt ? le intermediate by the Pinner conversion (28) 
described above for the maleic isomer in 58% yield as a crystalline- 
solid: mp 295 °C; >H NMR (DMSO-d.) 6 2.65 (t, 4 H) 3 52fa 

lSAS*, 5 ro ^ n 3 ,^ ^ H), 835 (bs, 8 H), 10.04 (s. 2 H). 

jV.[i.Methyl-4.[l.methyI-4.(4-nitrilobutaiuunido)pyr- 
"»tM-*[[(3-nttnIopropyl^ 

a^o]carbonyl]-4.pyrrolyl]i8ophtbalaimde. The title com- 
P«md was prepared by a similar procedure to that described for 
™» ~? ■» d fumanc derivatives in 95% yield as a crys- 
?^ "tm mP 278-82 »C; «H NMR (DMSO-dg) i 2.76 J». 

(t 2 H) 2 * i S ( J, 2 3 7 ; 6 25 1 H >- 8 " (2 d. 2 H). JL37 

3m Sff S 5ff£ ^ ^ 1514 ' 1531 ' 1584 ' 1638 ' W 
3121, 3307 cm »; FAB-MS (m/z) 759 MH*. AnaL Calcd for 

CrffcN^O,: C 60.1; H, 5.0; N. 22.2. Found; C. 59.8; H. 5 3; 

^-I4-I4-(4-Amino-4-iminobutanamido)-I-methylpyrrole- 
2^boxam.doH.methyl.2.pyrrolyl].jy%[2-[t[2-[[(3^Ltooi 
3-iminopropyl amino]carbonyl]-l-meUyl.4-iyrrolylU 

hydracnlonde (14). The title compound was prepared from the 
abovej propionitnle intermediate by the Pinner conversion de- 
tSdV. ™, n- "^i 6 * Md fumaric add structures in 78% 
fSlf?t fS^fSTT^J^ 243-50 ° C: ' H (DMSO-d,) 
6 2.65 (t, 4 H), 3.54 (q. 4 H), 3.84 and 3.90 (ws 12 H) 6 98 fd 

IMS im ^ ^A^til 1 ? 81 - 1377 ' 1403 ' 1463 ' 1529 ' 1581 - 
PoM f n u v?n ; FAB.MS (m/z) 743 (M - HC1 - CD* AnaL 

methy . 2 .[[(3.n triIopropyl)amino car^nyI?"S^^^ 
amno]carbanyi].4.py^lyl]terepntha^ 
^mpound waa prepared by a similar procedure to that described 
for the maleic fumanc, and isophthaiic acid compounds in 99% 



yield as a crystalline solid: mp 305-10 °C; l H NMR 

o « 5 A 4 . J 5 ' ZM 4 H), 8^4 and 8.90 (2 8, 12 ! 
2 10,7.12 (d, 2 H), 7.26 (d f 2 H), 7^9 (d, 2 H), 8.08 fe 
bol^i? 01 (s - 2 10^0 (s,2H);IR (Nujol) 1263, 
1438, 1467, 1516, 1556, 1584, 1640, 225a 3295 cm* FAE 

C^St ^"J&Wi*** C,601;a5AN,2 
C, 60.0; H, 5.3; N, 22.5. 

^4-[4-(4-Amino-4-iminobutanamido)-l-meth 

2- cal^rwludo^I-methy^^pyr^>IyI^^ 

3- lmmopropyl)amino]carbonyl]-l-methyM-|: 
ajnmo]wbonyl].l.methyl^pyrrolyi]terephthal 
hydrochloride (15). The title compound was prepare 
above propiomtrile intermediate by the Pinner conv 
described above for the maleic, fumaric, and isoph 
Mwnmw^ 6470 ^ 83 a crystalline solid: mp 26 
N ^ D M5fh W 6 2 65 (t » 4 »• "* 4 ». 3 84 f 

a i n r H, !4 6 i?? ft 2 H) ' 716 (d ' 2 W ' 7 » *«. 2 ». 7.2 
8.10 (s. 4 H). 8.29 (t, 2 H), 8.74 (bs, 4 H), 9.04 (bs, < 

2 H); m (Nujol) l260 » l377 - 1403 » 1 

l Tl l632 - 1671 » 16 87- 3300 cm-H FAB-MS 

5.4; N. 22.6; CI, 8.2. Found: C, 534; it 5.6; N, 22. 

DNaae I Footprinting Studies. The 139-base-pair 
fragment from pBR322 DNA, the partial sequence o 
« *2V IX1 Figure 3 ' waa isolated as previously describe 
w-Tlie fragment was labeled at the 3 / .tenninai poeitio 
[a-^P] ATP in the presence of reverse transcriptase, 
footprinting reactions were carried out in a total volun 
containing a final concentration of 37.5 mM Tria (pH 
mMMgCl 2 ,and2mMCaCl 2 . In the reaction, 2 vL of ca 
?NA (770 mM in base pairs) was mixed with 2 mL . 
diolabeled fragment To the ArpL DNA solution was ac 
of a binding ligand at a known concentration. After 
eqinhbration period, 0.1 unit of DNase I was added and 
proceeded at 37 °C for 10 min. The final ligand cono 
l "tne various footprinting experiments ranged from 1< 
mM. DNase I digestion was terminated by the addition 
of a denaturing loading buffer (ULB) containing 0 1 1 
70% urea, and 0.025% each bromophenol blue and xylei 
One-third of the total volume, 6 vU was loaded int 
denaturing polyacrylamide gel and electrophoresed foi 
imateiy 2 h at a constant temperature of 55 °C in a then 
field gradient, electrophoresis device, Autoradiography wi 
otrt by using Kodak XAR-5 X-ray film at *20 °C. In cert 
the autoradiographic data were scanned with a previc 
scribed microdensitometer (29). Ligand concentrate 
detenmned optically through the molar extinction coefl 
by direct weighing. 

Estimation of Drng-DNA Binding Constanta. Dr 
bmdmg constants were estimated as described previou 
To 2 mL of Tris-EDTA buffer, pH a containing 12 nMt 
bromide was added calf thymus DNA to give a final conce 
of 1.35 mM. The fluorescence was measured after equ 
for a few minutes, using a Turner Model 430 spectrofluc 
iI)^Sr er " AmsC0 Instruments » Carpinteria, CA) equippe 
150-W xenon lamp, at an excitation wavelength of 525 
an emission wavelength of 600 nm. Aliquotsof concentrai 
solutions were added, and the fluorescence waa m< 
Controls were performed to show that the drugs themse 
not interfere with the fluorescence measurements at it 
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Figure 4. Autoradiographic data taken from footprinting ex- 
periments involving methylene-linked ligands 6 [(CH^a, lanes 
3-11) and 7 ((CH^, lanes 13-21], Lanel: 139-mer in the absence 
of ligand and DNase I. Lanes 2 t 12, and 22: DNAse I cleavage 
of the 139-mer in the absence of ligand. The lane numbers and 
ligand concentrations in *iM are: 3 and 13, 194; 4 and 14, 97; 5 
and 15, 48.5; 6 and 16, 24.3; 7 and 17, 11.6; 8 and 18, 5.8; 9 and 
19, 2.9; 10 and 20, 1.6; 11 and 21, 0.8. 

employed. From a plot of the' decreased fluorescence of the 
ethidium-DNA complex with increasing dose of drug, the con- 
centration of drug needed to reduce the fluorescence by 50% was 
determined and used to calculate a relative binding constant for 
the drug, given the binding constant of ethidium to be 10 7 M~ l 
under similar conditions (31). This method allows rapid com- 
parison of a series of related compounds, but the individual 
binding constants may not correspond to values obtained by using 
other methods (30). 

Results 

Ligands Possessing Methylene Linking Groups. 
Compounds 3-1 1 all possess the same basic structure: two 
netropsin-like portions connected by a methylene chain. 
The length of the chain ranges from no methylene units 
to eight methylene units. As the length of the chain in- 
creases, the DNase I inhibition regions on the 139-mer 
enlarge only slightly. Figure 3 shows a summary of the 
inhibition regions for all nine ligands as well as for that 
of netropsin deduced from the DNase I footprinting data. 

Compounds 3-5, the shortest of the ligands studied, gave 
patterns identical with that of netropsin. Areas inhibited 
from enzyme digestion were (5'-3')ATAAC (50-45), 
AATTTAAC (62-55), and ATTTCA (92-87). A fourth 
area of inhibition was located at an AT tetramer, ATAA, 
around 159-156 (22). The exact size of the region and the 
nucleotides involved were difficult to assign owing to the 
low resolution in the higher molecular weight portion of 
the autoradiogram. 

Ligands 6-11 indicated slightly larger protection regions 
involving nucleotides 51 (G), 54 (T), 86-85 (TA), and 96-63 
(TTAG). The footprinting autoradiographic data for 6 and 
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Table L Approximate Concentrations at 50% Protee 
for Ligands with Various Tethers* 



ligand 



tether 



ooncentzfitto 



3 
7 
11 
12 
13 
14 
15 



CH, 

(CH 4 ) 4 

(CH,) S 

troru-ClV-CH, 

cia-CHj^CHt 

/H-C4H4 



3 
3 
2 
2 

11 
24 



• Values were determined by visual inspection of the footp: 
autoradiographic data. 




Figures. Autoradiographic data taken from footprint 
periments involving the trans (lanes 3-11) and cis (lanes 
olefin-linked ligands, 12 and 13. The lane assignments and 
concentrations are identical with those given in Figure 

7 are shown in Figure 4. Comparative DNase < 
printing data for netropsin on this 139-base-pair rest 
fragment may be found in ref 13 and 14. Second* 
hibition patterns also occurred around the prima 
involving many AT trimers. Specifically involve* 
nucleotides 54-51 (TGTG) located between two pi 
netropsin binding sites, 68-64 (GTTAG), 86-83 (T 
and 101-93 (CATTGTTAG). The 60% protection 
as visually estimated from the autoradiographic da 
curred at —3 nM for all ligands of the series (Ta 

Ligands with Olefin Linking Groups. Mark 
ferences found between the inhibition patterns of thi 
and cis isomers, 12 and 13, respectively (13, 14). 
pound 12 showed protection patterns encompassini 
sections of the fragment. Nucleotides 62-48 < 
TAACTGTGAATA) comprised one i nhibit ion .are 
nucleotides 101-81 (CATTGTTAGATTTCATAC 
comprised the other. No secondary sites could be dc 
for 12. At the highest ligand concentrations studie 
compound exhibited only two bands on the autorad 
(Figure 5), corresponding to the single- and d 
stranded (at —100) 139-mer. 

Conversely, the cis isomer, 13, exhibited a v< 
quence-specific pattern much like that of netropa 
14). Primary inhibition regions involved nucleotides 
50-48 (ATA), 62-56 (AATTTAA), and 92-87 (AT 
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Figure 6. Autoradiographic data taken from footprinting ex- 
periments involving the aryi-linked Uganda 14 (lanes 3-11) and 
15 (lanes 13, 14, and 16-22). Except for the fact that lane 15 was 
not loaded, the lane assignments and ligand concentrations are 
identical with those given in Figure 4. 

while nucleotides 55-51 (CTGTG), 68-63 (GTTATC), and 
101-93 (CATTGTTAG) were included in secondary sites. 
The midpoint of the titration of the strongest sites oc- 
curred at ~2 fiM (Table I) for both ligands. 

Ligands Containing Aryl Linking Groups. The most 
prominent feature about the autoradiogram of 14 and 15 
(Figure 6) is the low binding affinities of the ligands for 
DNA. Compound 14, the ligand with the meta-disubsti- 
tuted aryl ring, showed 50% inhibition at -11 mM. This 
value is 4-5 times greater than the amount of ligand 
necessary for 3-13 to produce similar protection (Table 
I). Initial region s of pr otection included nucleotides 50-48 
(ATA), 62-55 (ATTTAAC), and 92-87 (ATTTCA). Sec- 
ondary sites occurred at nucleotides 86-84 (TAC) and 
96-93 (TTAT). Compound 15, which baa the para-di- 
substituted aryl ring, showed even lower affinity for DNA 
than its meta counterpart The halfway point of the ti- 
tration with 15 occurred at —24 /iM. Areas of inhibition 
for 15 were similar to those of 14 (Figure 7). 

Discussion 

Ligands Possessing Methylene Chains. Lexitropsins 
3-11 were synthesized to accomplish two tasks. They were 
designed to traverse long sequences of DNA but at the 
same time maintain strong binding. The oligo(iV- 
methylpyrrolecarboxamide) molecules synthesized by 
Youngquist and Dervan (32) were successful in binding to 
longer segments of DNA. Unfortunately, the difference 
in the hydrogen bond periodicity of the latter ligands and 
the DNA decreased the binding affinities of the longer 
peptides. In an effort to overcome the phasing problem, 
compounds with two pyrrole domains connected by a 
flexible methylene chain were designed and synthesized. 
It was theorized that the nonbinding tether would adjust 
its length to allow both pyrrole portions of the ligand to 
simultaneously bind to DNA in a bidentate fashion (figure 
7). Since the pyrrole units have high affinities for AT 
sequences, a potential DNA binding site for the ligands 
consists of either two sufficiently long AT regions sepa- 
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Figure 7. Mono- and bidentate ligation of a bis netropsir 
is shown. 

rated by a short nonbinding sequence or a long coi 
tract of AT base pairs. 

An area of the 139-mer which contains two < 
binding sites separated by a relatively short non 
sequence is located at nucleotides 62-46. The s< 
of the site is 5^AATTTAACTGTGATAAA-3'. 
compounds bind to this sequence such that the 
moieties from a single ligand occupy both AT-ri 
in the sequence, the methylene linker will be pa 
over the nonbinding pentamer at nucleotides 
(CTGTG). In this case, enzymatic cleavage will 
hibited over the entire region. 

As is evident in Figures 3-5, the region from nuc 
62 to 46 does not titrate simultaneously. Inhi bit 
curred at the AT heptamer (62-56) and pentamer 
before they occurred at the central pentamer. T: 
suits indicated that only one of the two binding d 
of the ligand interact with DNA at any one time a 
monodentate ligation is taking place (Figure 7). Si 
pentameric sequence is longer than the longest 
moiety in any of the Uganda, Le^ compound 11, 
havior is expected. 

The initial inhibition regions for compounds 3 
generally broader than those observed for netropsir 
monodentate binding allows the ™ | jgwtf > d portio: 
tend in either direction away from the site of DNA < 
steric effects between the ligand and DNase I shou 
to either "side" of the site. This effect would block c 
by the enzyme and give rise to mfrflutfo n patterns i 
broader than those of netropsin. 

Quantitative footprinting studies involving ne 
and the 139-mer revealed that the primary binding 
89-92 on the restriction fragment has a binding a 
of ~10» M~ l (33, 34). Secondary netropsin sites 
have sequences of the type (A*T) 8 (G-C), have lower i 
constants in the range lOMO* M" 1 . Interestingly, 
lexitropsins 6-11 exhibited initial inhibition patter 
were larger than those of netropsin and compoui 
in the region 100-85 of the fragment This region c 
one strong netropsin site, an AT tetramer, which is : 
by two weaker sites. Thus, it appears that the 
having the longer linkers may be engaged in bic 
ligation by spanning a &C base pair which separa 
AT tetramer (strong site) from flflnlrmg AT trimei 
like netropsin, the binding of a bis lexitropsin.in 
odentate fashion to the high-affinity tetramer re! 
a local high concentration of the unbound portior 
ligand in the vicinity of an AT trimer. This may ca 
adjacent AT trimer to bind the re maining portioi 
bis lexitropsin at ligand concentrations not norm' 
sociated with site loading of AT trimers (33 f 34). I 
with shorter linking groups, e.g., 3-5, are not cap 
spanning the intervening G~C base pair and thus 
only bind in a monodentate fashion to the tetramer 
in this region. 

It may be argued that the best way to show bic 
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Figure 8. Schematic representations of the cis olefin. 12 (a) and 
trans olefin 13 (b) bound to DNA are shown. 

binding would be to employ a restriction fragment having 
a contiguous sequence of A*T base pairs sufficiently long 
to bind the entire ligand in a bidentate fashion. However, 
in this case it becomes difficult to distinguish between 
monodentate and bidentate binding. This is because the 
binding constants for all possible "half-sites" in the long 
sequence are likely to be similar and monoiigation on the 
different restriction fragments present in the mixture will 
give the impression that a single ligand is binding in a 
bidentate fashion to the entire sequence. Thus, the 
aforementioned observation that a weak netropsin site 
adjacent to a strong site, positions 100-85 of the 139-mer, 
binds a bis netropsin as well as the strong site suggests that 
the argument involving a local concentration effect is 
correct and that bidentate ligation is occurring. 

Ligands with Olefin Linkers. Compounds 12 and 13 
were designed to investigate the effects of the presence of 
a rigid linker as well as possible cis /trans effects on bi- 
dentate-ligand binding. The autoradiogram from the 
footprinting experiments involving these compounds 
(Figure 5) showed dramatic differences in inhibition pat- 
terns for the two ligands. Molecular models show that 
monodentate binding of the trans isomer forces the un- 
bound portion of the lexitropsin to be in the vicinity of 
DNA (Figure 8). This effect is greater than with 3-11, 
which have flexible linkers and hence greater degrees of 
conformational freedom. The additional steric constraints 
of 12 may be the reason why the initial inhibition region 
on the 139-mer for the compound appeared to be slightly 
larger than the analogous methylene-linked systems 
(Figures 3 and 4). Similar to the argument advanced for 
bidentate binding of 6-11, one part of lexitropsin, 12, oc- 
cupies a high-affinity site, while the other portion may be 
"bound" to a flanking sequence irrespective of the bases 
present in the sequence. 

The patterns resulting from the cis olefin ligand, 13. were 
not as broad as those for the trans olefin. In fact, they 
resemble those of netropsin. The relatively narrow pat- 
terns for 13 can be explained by considering the geometry 
about the cis olefin linking group. The optimal angle 
between the two netropsin moieties of the bis netropsin 
is not ~ 180° as in the case of 12, but is probably closer 
to ~90° (Figure 8). Molecular models show that 13 can 
bind to DNA in only a monodentate fashion with one 
netropsin moiety bound to the polymer and the other 
directed away from the minor groove of DNA. Because 
the unbound portion of the ligand is not in close proximity 
to the groove floor, the enzyme is able to cleave DNA near 
to the nucleotides actually involved in the binding inter- 
actions, thus yielding narrow inhibition patterns. 

Ligands Containing Aryl Linking Groups. The 
binding affinities of the aryl-linked compounds, 14 and 15, 
were much lower than the affinities of the previously 
discussed ligands. Although 50% protection points for 
3-13 occurred at concentrations ranging from 2 to 3 jiM, 
comparable protection was not seen with the aryl-linked 



compounds until 11-24 mM (Table I). The reduce 
finities of these compounds are probably the rest 
nonbonding interactions present in the structures, 
to steric interference between the amide carbonyl an 
aryl ortho hydrogens, the aryl rings of 14 and 1 
probably perpendicular to the two adjacent amide linl 
This also makes the aryl ring approximately perpendi 
to the pyrrole rings of both netropsin moieties. Whe 
ligands bind to DNA, the amide and the C-3 hydrc 
of the pyrrole units can be directed in toward the t 
groove just as they are in the netropsin-DNA com 
However, the perpendicular orientation of the aryi 
blocks deep penetration of the ligand into the minor g 
of DNA, thereby reducing binding affinities through 
effects. While there has been a proposal that the o\ 
plane piperazine ring of Hoechst 33258 serves as a c 
minant for a widened minor groove, i.e., a GC site (35 
has been disputed in a reexamination of the Hoechst < 
complex with d(CGCGAATTCGCG)* (36). Cert 
there is no evidence from the present footprinting data 
this is the case for 14 and 15. > 

In summary, the conformational freedom associatec 
two netropsins linked by a flexible methylene tether i 
ligand binding in both a mono- and a bidentate fas 
In the restriction fragment studied, bidentate ligation 
place by forcing the half-site with the lower bindinf 
stant to accept ligand at the same time that the haJ 
with the higher binding constant binds a bis netrc 
This is probably due to a local high concentration c 
of the "feet" of the ligand, causing an otherwise wea 
"to exhibit binding. For compounds possessing rigid te 
for example, cis and trans ethylene mono versus bide 
binding is influenced by the relative dispositions of tb 
feet of the bis netropsin. For the cis structure it : 
possible for both "feet" to simultaneously bind to i 
However, the trans geometry forces both "feet" c 
ligand to be near DNA. If one is bound to a high-af 
sequence, the other may extend over or be bound to a 
affinity site. Bis netropsins possessing aryl linking g 
have reduced DNA binding affinities. This is most 
due to the aryl groups, which are not coplanar wit 
netropsin moieties, thus blocking the ligand from d 
penetrating the minor groove of DNA, 

The relationship between the tethered netropsin 
the phasing problem (37) was not directly addressed i 
study. Since the restriction fragment possesses a h< 
geneous sequence and not all sites on the fragment 
ligand with equal affinities, possible mismatches bei 
hydrogen bond donor and acceptor sites on the DN, 
ligands cannot be detected in the footprinting 
Studies involving a more uniform binding substrate 
poly(dAdT)'poly(dAdT) and the bidentate ligands 
in progress), would appear to directly address this 

Efforts directed toward incorporating these mol< 
recognition parameters in the design of longer lexitn 
to target general predetermined sequences in the h 
genome will be reported in due course. 
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